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METHOD AND SYSTEM FOR DETECTING, 
TIMING, AND CORRECTING IMPULSE NOISE 



FIELD OF THE INVENTION 

This invention is related generally to receiving apparatus for a data commvinication system. 
More particularly, this invention is directed to a method and system for detecting, timing, and 
correcting impulse noise on a data communication channel. 

BACKGROUND OF THE INVENTION 

Communications systems are used to transfer data between remote transmitting and receiving 
locations over one or more physical channels. The integrity and reliability of such data 
transmissions can be compromised by several factors, such as transmission imperfections, 
receiving imperfections, and physical channel interference. Examples of interference which 
affect the quality and rate of data transmission include impulses (or bursts), and other short 
duration events, and long duration interference, such as ingresses. A relatively large effort has 
been directed to improving the transmitter and receiver robustness for long duration interference. 
However, compensating for impulse^urst noise has received little attention. Impulse or burst 
noise can be caused by many uncontrollable events, such as arcing and electrical transients 
coupled through a power system, lightning, etc. This type of noise typically occurs at 
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unexpected times and lasts for a relatively short period of time (on the order of several 
microseconds). Because it is broad band in effect, impulse noise is of particular concern in 
multi-carrier transmission schemes since the impulse noise may afifect many, or even all channels 
simultaneously. 

5 For example, in a multi-tone modulation transmission scheme, a number of carriers 

positioned at different frequencies are used and data is transmitted simultaneously in parallel 
over the carriers. Each band carries a fraction of the total information being transmitted. The 
discrete bands or sub-channels are independently modulated, and each has a carrier frequency at 
the center frequency of the particular band. For example, multi-tone modulation schemes 
10 include DMT (Discrete Multitone modulation), used for DSL channel communication, and 
^2 VCMT (Variable Constellation Multi Tone), suggested for use in HFC CATV applications, 
iil In one DMT configuration, a 1.1 MHz chamel is broken down into 256 sub-channels or 

III bands, each of which is 4 KHz vdde. Each sub-channel has its own carrier frequency. Each of 
PJ the sub-channels is used to transmit a fixed number of information bits in a single symbol or 
iai5 signal period. The DMT system monitors the signal to noise ratio for each of the sub-channels 
h and uses this information to determine how many bits per signal period (symbol) may be carried 
Jf? in each of the sub-channels. The number of bits per signal in a sub-channel is typically referred 
S to as the loading of the sub-channel. The DMT system dynamically adjusts the loading of each 
r;' of the sub-channels in accordance with the noise characteristics of the sub-channel. If a sub- 
20 channel is particularly noisy, it may not be used at all. 

VCMT modulation is a transmission scheme specifically designed to effectively combat 
the high ingress and impulse/burst impairments present in cable TV channels, and also to 
maximize the data throughput capacity of such channels. Like DMT, data is transmitted over 
multiple sub-channels, typically 36. However, in VCMT, data is transmitted using a variable bit 
25 loading per tone, along with coding and interleaving. The VCMT system measures the noise 
present on each channel and mdependently modulates the tone data transmission scheme from 
QPSK (quadrature phase shift keying) to 256-QAM (quadrature amplitude modulation) 
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accordingly. Dviring operation, the signal-to-noise ratio across the channels are monitored for 
each tone and the headend receiver instructs the upstream transmitter in the cable modem to 
modify the QAM constellation for each tone to maintain a desired bit error rate. VCMT also 
uses spectral shaping to reduce the frequency sidelobes of the tones to thereby reduce the effect 
5 of narrowband interference to only the closest tones. 

Multi-tone transmission schemes have relatively good noise immunity. However, there is 
a tradeoff when resistance to impulse noise is considered. Because data is transmitted on 
multiple channels, the symbol length can be relatively long while still providing a high overall 
data transmission rate. Both DMT and VCMT have relatively long symbol periods. Thus, each 
10 has fairly good immunity (compared to a single carrier signal) from weak time domain events, 
such as impulses and bursts, because the effect of a short time domain event v^U be averaged out 
ill over the relatively longer symbol period. However, strong impulses are very damaging to DMT 
Sl transmission because the impulse may simultaneously corrupt all of the 256 symbols occupying 
^ the 256 DMT sub-channels. Similarly, strong impulses are also very damaging to VCMT 
Q15 transmissions because the impulse can corrupt the symbols occupying every VCMT sub-channel 
3=i at the time the impulse hits. In addition, because VCMT uses spectral shaping, neighboring 
ifj symbols may also be corrupted by the impulse. 

i|l Impulses can be corrected by either canceling or blanking. Impulse canceling is a 

!=: process where the impulse waveform is estimated and then subtracted from the data signal at the 
20 appropriate time. Impulse blanking is a process in which the impulse is located in time and the 
data input is zeroed (or blanked) for the duration of the impulse. Canceling is superior to 
banking because it preserves the underlying data. However, it is considerably more complex to 
implement. 

Various techniques for compensating for impulse noise are known. U.S. Patent No. 
25 5,410,264 to Lechleider, which is incorporated herein by reference, describes very basic 
techniques for impulse detection, timing, and cancellation. The techniques are based on match 
filtering with a known impulse shape. An impulse is detected by using a match filter and 
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monitoring when the filter output energy crosses a defined threshold. The location of the 
impulse is estimated to be the single match filter output sample which has the maximum energy. 
However, the techniques disclosed in the '264 patent are incapable of handling impulses having 
more than one degree of freedom (e.g., spanned by a basis of more than one vector). 
5 U.S. Patent No. 5,479,440 to Esfahani, which is incorporated herein by reference, 

discloses an impulse cancellation technique which compensates for out-of-band impulse noise by 
filtering out frequency components suspected as impulses. However, the '440 patent does not 
adequately address in-band impulse noise. U.S. Patent No. 5,703,904 to Langberg, which is 
incorporated herein by reference, discusses temporarily inhibiting (e.g., blanking) modem 

10 adaptation circuits when an impulse is detected. However, impulses are detected only by using a 
5 simple thresholding input signal. Further, no attempt is made to cancel the impulse. A 

11 somewhat more detailed impulse detection scheme, using an adaptive threshold, is disclosed in 
U.S. Patent No. 5,119,321, which is incorporated herein by reference. 

1^? However, none of these prior art impulse detection and compensation schemes provides a 

SI 

^ robust and efficient impulse detection system which can adequately detect impulses which have 
n two or more unknown (or varying) degrees of freedom. Further, none discloses an efficient 

scheme for handling strong impulses in a multi-channel transmission system, 
jfj Accordingly, it is an object of the present invention to provide a method for 

H compensating for strong impulse interference in a muhi-carrier system, such as DMT and 
20 VCMT. 

It is a ftirther object of the invention to provide an improved technique for detecting and 
timing impulses which have attributes that span a number of degrees of freedom. 

Yet another object of the invention is to provide a system which will reliably detect an 
impulse, determine its attributes, and then take appropriate corrective action. 
25 Yet a ftirther object of the invention is to provide an impulse detection and correction 

system which is suitable for in band impulse noise as well as out-of-band impulses. 
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A further object of the invention is to detect and correct non-stationary noise, i.e., noise 
which has statistics which change over time. 

SUMMARY OF THE INVENTION 

This invention is directed to novel methods and apparatus for detecting and correcting 
5 impulse noise on a data communication channel. In one embodiment of the invention, a dual- 
detection scheme is used. The communication signal, which can be a single carrier signal or a 
multi-carrier signal, is input to two or more separate impulse detectors, each of which uses a 
different impulse detection scheme. Preferably, complementary detection schemes are used such 
that the weaknesses in the first detection scheme are countered by the strengths in the second 
kQ detection scheme, and vice versa. 

'0 xhe outputs of the impulse detectors are provided to an impulse correction enable circuit 

ffl which uses the impulse detection signals to determine when an impulse is present and produces 
^ an output indicating that corrective action should be taken. Preferably, more than one corrective 
y action can be taken and the enable circuit further indicates what corrective action to take. The 
Is received signal is also provided to the impulse correction circuit, such as a canceling / blanking 
hJ circuit. In accordance with the oxitput of the enable circxiit, the impulse correction circuit 
RJ corrects for impulse noise by, e.g., subtracting an estimated impulse waveform from the data 
O signal at the appropriate time or by blanking the data signal for the duration of the impulse. The 
output of the impulse correction circuit can be sent directly to subsequent components, such as a 
20 decoder. Preferably, and according to another aspect of the invention, the output of the pulse 
corrector is provided to a reliability estimator and selector. The original (uncorrected) data 
signal is also input to the selector. The selector compares the corrected and uncorrected input 
signals and determines which is more reliable. The most reliable signal is then selected and 
provided to subsequent components. 
25 Various impulse detection schemes can be used. One preferred and novel impulse 

correction scheme is based on utilizing the high intertone impulse correlation generated when an 
impulse hits a multitone symbol. When a short impulse hits a multitone symbol that is longer 
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than the impulse duration, the noise phasors inflicted on neighboring tones are very similar. This 
similarity can be quantified by computing the cross correlation among neighboring tones, which 
would thus be close in absolute value to 1 (the phase depends on the timing of the impulse within 
the symbol). The high cross correlation is utilized for estimating impulse timing and for 
5 estimating the impulse noise waveform. 

In particular, the new impulse detection / correction scheme uses quiet tones (sub 
channels) among the multiple channels in the data link wherein a fraction (typically a few 
percent) of the sub channels are not used for data transmission. According to this aspect of the 
invention, the signals on the quiet tone channels are uncorrupted by any transmit signal and 
10 therefore can be used to accurately detect the presence of an impulse. Because of high cross 

0 correlation of impulse noise across all channels, an UBpulse detection in a quiet-tone can be used 
m to indicate an impulse on other channels as well. Most preferably, impulses are detected and 
%l their timing estunated based on a Gram Schmidt orthogonalization process. 

01 xhe new Gram Schmidt scheme provides superior impulse timing and shape estimation. 
il|5 However, it is less accurate when multiple impulses are present in a short time period. 

Accordingly, when used in a dual-detection scheme, a complementary detection scheme which 
more accurately detects multiple impulses should be used. Preferably, a detection scheme based 
Jfi on a signal energy moving average window is used. 

^ In addition to detection of impulses, new methods of canceling impulses are also 

20 presented, including a linear least square scheme, and a decision feed back scheme. 

The new technique for detecting and canceling strong impulses can be used in 
conjunction with multitone transmission schemes, such as DMT and VCMT, to solve their 
vulnerability to impulse noise. However, the new method is not limited to multitone 
modulations, but also is suitable for other types of communication schemes, such as CDMA and 

25 single carrier modulation, and even non-communication signals. Furthermore, by applying the 
present method but switching the model time and frequency axes, the invention can be used to 
acquire the frequency of fast rising or changing ingresses (narrow band interference) and cancel 
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them accordingly. In addition, in some data transmission and reception schemes, received and 
amplified signals may be subjected to clipping. The effect of clipping on a signal is similar to 
that produced by impulse noise. The present invention may also be adapted to detect and cancel 
noise generated by clipping, thus permitting the use of less complex and expensive receiving 
5 circuits. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other features of the present invention will be more readily apparent 
from the following detailed description and drawings of illustrative embodiments of the 
invention in which: 
=10 FIG. 1 is a block diagram of an impulse detector and corrector; 

'fi FIG. 2 is a detailed block diagram of one embodiment of the impulse detector and 

ill corrector of Fig. 1; 

J^: FIG. 3 is a block diagram of a Gram Schmidt-based impulse detector; 

H FIG. 4 is a block diagram of a moving window impulse detector; and 

A5 FIG. 5 is a block diagram of a decision feedback impulse cancellation module. 

!l! DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

^11 Turning to Fig. 1, there is shown a block diagram of an impulse detector and corrector 

r: system 10 according to the invention. The received input communication signal 11, which can 
be a single carrier signal or a multi-carrier signal, is input to an impulse detector 8, which 
20 preferably comprises two or more separate impulse detectors, 12, 14, each of which uses a 
different impulse detection scheme. The outputs of the impulse detectors 12, 14 are provided to 
an impulse correction enable module 16, The signals produced by the impulse detectors 12, 14 
indicate when an impulse has been detected and may contain information about the detected 
impulse itself The correction enable module 16 uses the information contained in the impulse 
25 detection signals to determine when an impulse is present and signals via its output that impulse 
correction is required. 
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In the preferred embodiment, the outputs of two or more separate impulse detectors are 
used to determine the timing and possibly shape of an impulse. Thus, the results are generally 
more accurate than if a single impulse detector is used. Further, the use of multiple detectors 
permits the use of complementary detection schemes, wherein weaknesses inherent in a first 
5 detection scheme are countered by the strengths in a second detection scheme and vice versa. 
Preferred detection schemes are discussed in detail below. 

Impulse correction can be by cancellation or blanking. Preferably, both methods are 
available. In such a situation, the output of unpulse detector 8 indicates whether the impulse 
should be canceled or blanked. When two or more impulse detectors 12, 14 are used in impulse 
10 detector 8, the selection of which correction method to use is made with consideration as to the 
■? accuracy of the impulse detection schemes in use and how well the impulse waveform can be 
Hi predicted in various circumstances. If the impulse correction enable circuit 16 determines that an 
a I accurate waveform can be generated, an indication to cancel the impulse is output. Otherwise, 
5^5 an indication to blank the impulse is output. 

iJIS An impulse corrector module 18 is provided which receives the input signal 11, pre- 

n processed by a match filter 68 if necessary, and the output from the impulse detector 8 
!!! (correction enable 16). In accordance with the signal output firom the impulse detector, the 
Id impulse correction module 18 corrects for the detected impulse noise by, e.g., subtracting an 
rf estimated impulse waveform fi-om the data signal at the appropriate time or by blanking (zero- 
20 ing) the data signal for the duration of the impulse. The output of the impulse correction module 
18 can be sent directly to subsequent components, such as a decoder. Preferably, however, the 
output is provided to a reliability estimator and selector 20, which also receives as input the 
original (uncorrected) data signal, pre-processed by a match filter 72 if necessary. Selector 20 
evaluates the corrected and uncorrected data signals to determine which is more reliable. The 
25 most reliable signal is then selected and provided to subsequent components. As can be 
appreciated, the dual-output path can be implemented using a variety of impulse detectors. 
Preferably, however, a dual-path impulse detector, such as described below, is used. 
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Tuming to Fig. 2, there is shown a detailed block diagram of one embodiment of the 
dual-path impulse detector and corrector 10 of Fig. 1. In this embodiment, a novel Gram 
Schmidt-based impulse detection scheme which detects impulses and estimates their timing 
primarily through the use of quiet tones in a multi-tone transmission is used for the first impulse 
5 detector 12. A moving window threshold impulse detection scheme that detects impulses by 
thresholding the signal energy of the entire signal plus interference is used for the second 
impulse detector 14. These impulse detectors are configured for use in data transmission 
schemes, such as DMT and VCMT, which use multiple data transmission sub-channels, and are 
discussed in more detail below with reference to Figs. 3 and 4, respectively. 

10 Also shown, although not technically part of the detector and corrector 10, is an analog 
y and digital front end comprising an A/D converter 22 and a decimator 24. The received analog 

11 data signal is sampled by A/D converter 22 at a rate sufficient to prevent aliasing of the data 
Sl signals. Because of the difficulty of implementing high roll off analog filters, the sampling rate 
ffl is usually considerably higher than required by the Nyquist Rate. To reduce the samplmg rate to 
m a value closer to the Nyquist frequency, the sampled signal is decimated by decimator 24 to 
U produce the input signal 11 which is subsequently processed by the impulse detector and 
W corrector 10. The front end may also contain standard timing and frequency loops. These are 
!f| well-understood components to those skilled in the art and will, thus, not be described in detail. 

The Gram Schmidt impulse detection method produces data which is suitable for 
20 estimating the waveform of a detected impulse. The output of detector 12 is used by an impulse 
estimator 19 which estimates the impulse waveform. This information is used later for impulse 
cancellation, when appropriate. 

To further improve the accuracy of one or both of the impulse detectors, timing 
information from analogous detectors operating in neighboring signal bands may be shared and 
25 merged. Because impulses are typically wide enough to cover many adjacent receiver bands, 
sharing and merging impulse detection information from multiple detectors operating on 
adjacent bands can further improve the accuracy of the detection. For example, as shown in Fig. 
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2, the output of neighboring band impulse detectors analogous to detector 12 is provided via 
TO A interface 12 to a module 13 which merges the impulse time of arrival (TO A) determined by 
detector 12 and its neighbors. The weight to be given to the neighboring band impulse detection 
information when merging this information is dependent on the degree to which impulses are 
5 expected to cross multiple bands. Various merging methods will be apparent to those of skill in 
the art. This feature is discussed further below. 

In addressing the operation of the various impulse detectors and other aspects of the 
invention, it is useflil to define various terms and parameters. The signal input to the system is 
comprised of the Xmit signal (transmitted from remote transmitter), stationary noise sources (e.g. 
10 white noise or narrow band noise), and impulses. From the point of view of the impulse 
5; detection and correction mechanism, the impulse is the signal of interest while the Xmit signal 
m and stationary noise sources are the imwanted interfering signals. These unwanted signals are 
Sii referred to herein as the background noise. 

jH It will be assumed that impulses to be detected have an expected length of L. If the 

1315 detector 10 is used when impulses of indeterminate length are expected, the methods for 
h detecting impulses of a known length L discussed below may be repeated with each possible L. 
T In addition, the following parameters will be used in the following discussion of impulse 
In detection and correcting schemes, as well as in the mathematical appendices: 
F:^ NTones - Number of tones (sub-channels, carriers) in the multi-carrier communications 

20 channel 

Mgs - Gram Schmidt matrix of size (L * L), time dependent for DMT (see 
derivation in appendix) 

Mls - Least square impulse estimation matrix of size (L * L), time dependent for 

DMT 

25 THgs - Gram Schmidt window energy threshold. 

THmw - Moving window energy threshold. 

THera - Erasure threshold used by the reliability detector 
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Tg - The size of group on which local maximization is performed as part of 
Multiple Detection Suppression 

NN - The number of neighbors used when merging impulse detection TOA 

information from other bands 
5 Af - The carrier frequency difference between successive tones 

f(n) - The impulse response of the whitener 30 (fig 3) 

Gk - gain at center frequency of tone K when viewing f(n) in frequency domain 
y(n) - Signal at output of whitener 30 (fig 3). 

a - The background noise power after being whitened by whitener 30. 

10 a - The memory of the decision feed back impulse canceler 

Iri Turning to Figs. 2 and 3, the impulse detection using the new Gram Schmidt-based 

J5 detection will now be discussed. According to one aspect of the invention, the input signal 1 1 is 
IP initially processed to accentuate impulses which may be present relative to other signals on the 
1^5 NTones sub-channels being considered. The preprocessing makes use of quiet tones, e.g., sub- 
% channels which are not being used to transmit data, and therefore only contain noise from 
CI stationary sources. More particularly, the input signals are processed by a whitening filter 30 
!JJ which equalizes the noise in the input signal and produces an output y(n). The whitening filter 
30 has an impulse response f(n), that provides a gain of Gk at the center fi-equency of tone K. 
M The gains Gk are selected to whiten the input background noise (input to 30), as a result the noise 
20 at output of whitener 30 is primarily a a power white Gaussian noise. Various whitening 
schemes are known to those of skill in the art. 

The sub-channel(s) designated as quiet tones can be selected arbitrarily. However, there 
are performance differences between different quiet tone arrangements, as will be appreciated by 
one of skill in the art. One way to assess the merits of a certain arrangement of quiet tones is to 
25 plot the combined impulse response of the whitener 30 and whitener match filter 32, (i.e., 
f(n)*P(-n)). In a preferred quiet tone arrangement, the combined impulse response at the middle 
of the impulse is maximized while the impulse response at other locations is minimized. 
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Coefficients of the whitener filter 30, as well as the coefficients of the detection matrix 
Mgs generated for use in the impulse detection routine, and discussed below, are generated in 
accordance with dynamic measurements of the noise power spectrum of the input signal. The 
background noise is preferably continuously measured and the relevant matrices and filters 
5 updated accordingly. Because changes in background noise are relatively slow, the matrix 
update rate can be low. Background noise measurement can be done using conventional 
techniques, such as averaging of the per-tone-energy of sheer errors, where the sheer error is the 
difference between the sheer input and output. 

Data signals are not transmitted on the quiet tone sub-channels. Thus, these sub-channels 

10 (in the absence of an impulse) contain only the stationary noise sources. The power spectrum for 
O these channels will therefore be considerably lower than that on the sub-channels used to 

11 transmit data (i.e., those with the higher "backgroimd noise") and so, during the whitening 
!i= process, the data carrying sub-channels will be attenuated far more than the quiet tone sub- 
Ki channels. As a result, when an impulse is present, it will have a much larger effect on the power 
ii|5 level in a whitened quiet tone when compared to the power level of the sub-channels which are 

carrying data tones. This effect is graphically illustrated in the input and output graphs adjacent 
lil the whitener filter 30 shown in Fig. 3 . 

JfJ The output of the whitener filter 30 is passed through a whitener match fiher 32. This 

F' filter has a transfer function which is the complex conjugate of the whitener, i.e., f*(-n) and 
20 produces from input y(n) an output equal to y(n)* f * (-n). The whitener match fiher 32 

functions to collect the energy from an impulse into a short interval having duration similar to 

the impulse. 

The whitener filter 30 and whitener match filter 32 are preferably implemented as 
frequency-domain multiplications, although time convolution can be used instead. For 
25 frequency-domain, a filter bank is used that analyzes the input to the whitener 30, and then 
applies a per-band gain for the whitener, then a per-band gain for the match filter 32, and finally, 
combines all bands into one wider signal, e.g., by using a filter bank. The per-band whitener 
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gain and the per-band whitener match filter gain can be combined into one gain (i.e., multiplied) 
to reduce complexity. Efficient implementation of a filter using filter banks is well-known to 

those skilled in the art. 

Following the whitener and whitener match filters 30, 32, the processed signal is fed to 
5 the Gram Schmidt Impulse TOA estimator 34, shown in detail in Fig. 3. First, the processed 

signal is fed to a FIFO 40 having a length L equal to the estimated length of the impulse. Each 

time a new sample enters FIFO 40, the oldest sample is discarded. The output of FIFO 40 is a 

parallel column vector of height L. 

Next, the vector output of the FIFO 40 is left multiplied by a Gram Schmidt matrix Mgs 
1 0 of size (L * L) in Gram Schmidt module 42. The output of Gram Schmidt multiplication 42 is a 
Cl column vector of height L, where each component of the vector represents the projection of the 
lij impulse on one of the Gram Schmidt (orthogonal) basis vectors. In general, multiplication by 
[ij matrix Mgs transforms values that are the results of correlation with the match filter (i.e., f^(-n)) 
61 into projections on the Gram Schmidt basis vectors. The specific form of matrix Mas is 
i4|5 dependent on the type of data transmission scheme at issue. Detailed derivations of the Gram 
%, Schmidt matrices for VCMT and DMT are presented in Appendixes A-C, below. Similar 
Ul analyses can be used to derive the appropriate Gram Schmidt matrix for other data transmission 
Ifi schemes, as will be imderstood by those of skill m the art in view of the present disclosure. In 

addition, while the appendixes present a derivation of some implementations of various aspects 
20 of the present invention, the invention is not limited to an implementation which adopts the 

precise mathematical relationships discussed in the appendixes and various adjustments to the 

parameters and mathematical relationships may be made while still falling within the overall 

scope of the invention. 

The column vector output of Gram Schmidt module 42 is processed by Energy 
25 Summation module 44. Summation module 44 sums up the square values of the coordinates of 

the input vector. This value represents the (estimated) net impulse energy after being filtered by 
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the whitener filter for the impulse samples which are contained in FIFO 40. Thus, the value can 
be used to determine whether an impulse is present and, if so, when it occurred. 

As an artifact of this impulse detection scheme, the impulse energy may be smeared 
across several time points. In particular, a genuine impulse will generate a high-energy sample 
5 at the output of energy summation 44 which has a temporal location corresponding to the TOA 
of the impulse. However, and in addition, the impulse will generate lower energy values at small 
distances from the actual (correct) TOA of the impulse. This leaked energy is usually 
significantly lower energy than the energy present at the impulse's TOA. To avoid erroneous 
multiple detections, a Multiple Detection Suppression module 36 is used. 
10 In a preferred embodiment, Multiple Detection Suppression module 36 comprises three 

sub-modules as shown in Fig. 3. Group divider 46 divides the vector input into several non- 
Ill overlapping groups of size To and identifies the maximum energy point in each group. These 
|j group maxinuims are provided to a Local Maxima Detector 48. Energy values that are higher 
F! than a previous group and not lower than a subsequent group are passed to Threshold Local 
05 Maxuna module 50. (Only immediate neighboring groups are considered. This is the usual 
h Local Maxima Detection). Finally, local maximas that are higher than a predetermined energy 
i! threshold, THgs, are identified. The temporal position of those identified maximas corresponds 
Ji to the TOA of an impulse. It should be noted that the final TOA is not the timing of the local 

maxima achieving group, but the timing of the maximum sample within that group. 
20 Thus, multiple detection suppression module 36 acts to remove the false alarms and pass 

on only the true impulse detection using local maximization and assuming that the true detection 
energy is higher than the surrounding energy. The size To determines the size of the area in 
which local maxunization is performed. The Local Maxuna Detector 48 assures that two 
impulse detections are at least To apart. It can be appreciated that increasing Tg reduces false 
25 alarms but also reduces the probability of genuine impulse detection. (In cases when two 
genuine impulses occur in an interval shorter than Tq samples, one of the impulses would be 
discarded by tiiis process.) Those of skill in the art will recognize that there are a variety of other 
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techniques for multiple detection suppression, each having various tradeoffs between false alarm 
rates, probability of impulse detection, and complexity. 

This impulse detection scheme can be modified in several ways. One variation is 
removing the Gram Schmidt multiplication (in module 42). This can easily be accomplished by 
5 setting Mqs to the identity matrix. As a result, the energy summation will compute the energies 
in the output vector of FIFO 40 directly. This reduces performance but also reduces complexity 
since Mgs does not need to be computed. An alternative option is available if a good prediction 
of the impulse power is available in advance. If so, this information can be incorporated into 
matrix Mqs. 

10 As discussed above, and with reference to Fig. 2, there are cases where a channel is 

O divided into multiple successive bands, each of which is utilized by a different set of modems, 
ill Typically the same transmitter and receiver hardware is used, but located at different non- 
JiJ overlapping bands. For example, in an HFC channel, a 35MHz upstream channel (5-40MHz) 

may be divided into successive 1.6MHz bands. On each HFC band, there are multiple 
115 transmitters (e.g., the cable subscribers) and a single receiver at the Head End (cable central 
% office). In many channels, impulse noise has a wide enough bandwidth to affect multiple 
^Jl successive modem bands. Advantageously, the timing of impulses on each band is detected 
Jo independently, and then the multiple detection information is combined to obtain a refined 

estimation. 

20 In the preferred embodiment, a local TOA estimation is output from detector 12 and 

which, when a Gram Schmidt detector is used, includes sample energy found by Local 
maximizer 48 after multiple detection suppression. This data is broadcast to neighboring bands 
via the TOA interface 12'. At the same time, neighboring band receivers also broadcast their 
multiple detection suppressed TOA's (including sample energy found by their respective local 

25 maximizers). The TOA interface 12' receives the TOA's of neighboring bands and sends them 
to the TOA Merge module 13 which combines this data with the local TOA estimation. 
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In a preferred implementation, the TOA estimates are combined using a weighted 
average, where weight is related to the TOA sample energy. There are other possible TOA 
merge algorithms, with tradeoff being between estimation noise and complexity. For example, 
the merge may be based on a median computation on all the TOA's, where the sample weight is 
5 interpreted as a TOA repetition used for the median computation. 

When TOA information from multiple bands is merged, it is possible to vary the 
distribution of quiet tones in ways not possible without this data sharing. In one embodiment, 
the quiet tones are distributed evenly among different bands, in which case there will be similar 
TOA estimations of similar reliabilities from all bands. In another embodiment, the quiet tones 

10 can be distributed unevenly. In an extreme case, one band may host all of the quiet tones, and all 
^=f other bands may use its estimation directly. 

11 Turning to Fig. 4, there is shown a block diagram of a moving energy window impiilse 
51! detector, here implemented as impulse detector 14. As shovra, the output of decimator 24 (which 
j^J is not considered part of the detector) is fed to FIFO 60 having length L. Each time a new 
05 sample enters the FIFO 60, the oldest sample is discarded. The contents of the FIFO 60 are 
JU, output in parallel as a vector of length L. An energy summation module 62 receives the FIFO 60 
Ul output and generates the sum of the square values of the coordinates of this vector. The energy 
Jfl values received from energy summation module 62 are compared to THmw , the moving window 

impulse threshold by thresholding module 64. All sample times comprising windows having 

20 energy higher than THmw are considered to contain an impulse. 

A variety of other ways of implementing a moving window impulse detector can also be 
used. The particular implementation used is largely dependent on the implementation 
technology (e.g. DSP, ASIC etc.). For example, a moving window of length L may be 
implemented using an accumulator. The accumulator is first set to the sum of energies (squares) 

25 of L consecutive samples, giving the initial moving window a value. Then, each time the 
moving window is advanced, one sample (in time), the energy of the oldest sample is subtracted 
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&om the accumulator and the energy of the new sample is added, thus the accumulator energy 
then corresponds to the new set of samples. 

A disadvantage of the moving window detection scheme is significantly less accuracy in 
impulse timing estimation vs. the new Gram Schmidt process, discussed above. The moving 

5 window is capable of detecting nearby impulses and estimating their timing only if the impulses 
are significantly strong compared with the Xmit signal, since the Xmit signal is an interfering 
noise relative to impulse detection. The Gram Schmidt scheme, by primarily relying on the quiet 
tones where the Xmit signal does not interfere, achieves superior performance. The background 
noise, and not the Xmit signal strength, limits the Gram Schmidt performance. In most cases of 

1 0 interest, the background noise is much weaker than the Xmit signal. 

S However, the movmg window impulse detector is good at detecting situations where 

Wl multiple impulses occur close together while the Gram Schmidt timing detection is not accurate 
IJ! in detecting closely spaced impulses. If the vahie of Tg is small, a high false alarm rate will 
F! result. If; on the other hand. To has a large value, there will be a large number of missed impulse 
Cis detections because only one impulse is assumed to occur within Tq. Because the strengths and 
1=1 weaknesses of these two impulse detection schemes complement each other, they are well 
H! matched for implementation in the dual-detector scheme shown in Fig. 1 . 
3 As shown in Figs. 1 and 2 and discussed above, in one embodiment of the invention, two 

H methods of correcting impulses ~ cancellation and blanking ~ are implemented. In order to 
20 determine which type of correction scheme to use, the detection signals from the two impulse 
detectors 12, 14 are evaluated by an impulse correction enable module 16. As noted previously, 
impulse cancellation refers to an estimation of the impulse signal, i.e., by Gram Schmidt 34 
(timing only) coupled to an impulse estimator module 19, and subsequent subtraction of the 
estimated impulse from the data received signal at summer/blanker 66 (Fig. 2). If the estimate is 
25 accurate, cancellation removes the impulse from the received signal altogether. The Gram 
Schmidt 34 TOA estimation coupled with an esthnated waveform from impulse estimation 
module 19 are capable of estimating and canceling only short and separated (in time) impulses. 
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(The impulse duration and impulse separation supported by Gram Schmidt depend on the 
nvimber of quiet tones and on their placement). 

In contrast, impulse blanking refers to zeroing the match filter input at summer/blanker 6 
for the duration of the impulse as determined, e.g., by the output of the moving wuidow 
5 threshold detector. Blanking removes the impulse noise but introduces a new impairment 
because of the Xmit signal zeroing (for the duration of the impulse). Signal zeroing manifests 
itself by loss of Xmit energy and, more importantly, by inter-symbol interference (ISI) and inter- 
tone interference for multitone (ITI). This penalty makes blanking advantageous only for those 
impulses having an instantaneous power which is strong, when compared with Xmit signal 

10 carrier power, making recovery of the Xmit signal unlikely. Moreover, a moving window TO A 
3 estimation is effective only for strong impulses, making blanking useful with strong impulses 
M only. In general, it is better to use cancellation if possible. 

In the preferred embodiment, where the complementary Gram Schmidt and moving 
^? window impulse detection schemes are used, the correction enable module 16 is configured to 
16 determine whether to use Impulse Cancellation or Impulse Blanking based on the relative 
strengths and weaknesses of these methods. If an impulse is detected by Moving Window but 
f 1 not Gram Schmidt, then blanking is used. If an impulse is detected using Gram Schmidt but not 

11 Moving Window (as when the impulse is too weak for Moving Window detection), then 
cancellation is used. If an impulse is detected by Gram Schmidt at a first TO A, and an impulse 

20 is detected using Moving Window in the vicinity of that TO A, then, if the moving window 
detections coincide with the first TOA, cancellation is used. Otherwise, if the Moving Window 
detections do not coincide with the fust TOA, blanking in the detected impulse interval is used. 
Logic circuits or programs to implement this decision process can be easily generated by those of 
skill in the art. If other types of impulse detection modules are used, the operation of the enable 

25 module 16 may need to be adjusted accordingly, as is well-known in the art. In addition, the 
enable module 16 can also be configured to combine impulse detection data fi:om more than two 
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different detection schemes and act accordingly based on the relative strengths and weaknesses 

of the implemented schemes. 

Impulse blanking is performed according to the TOA of the impulse as detected by, e.g., 

the Moving Window detector. The number of samples blanked corresponds to the length L of 
5 the moving window on which the impulse was detected. More precisely, the L consecutive 

samples output from decimator 24 (Fig. 2) that were grouped by FIFO 60 and identified as an 

impulse by thresholding their energy, are zeroed by summer/blanker 66 at the input of match 

filter 68. Other samples (i.e., those which were not declared to be an impulse) are not altered. 

Various techniques may be used for canceling a detected impulse. Two alternate methods are 
10 preferred a linear Least Square (LS) cancellation scheme and a Decision Feedback (DFB) 
^"f cancellation scheme. 

m In a Least Square cancellation scheme, impulse LS Estimation 19 is configured to 

m estimate impulse coefficients in the time domain using data gathered by examining the quiet 
^ tones. The estimated impulse waveform is then subtracted by the summer/blanker 66 firom the 
05 received signal. The impulse LS estimation is performed on the L consecutive samples output of 
h whitener match filter 32 that were grouped by FIFO 40 and then identified by Gram Schmidt 
module 42. The estimation is performed by multiplying the L long sample vector from FIFO 
Ji 103 with matrix Mls (which is an L*L matrix). For reference, one derivation of this impulse 
H Least Square estimation is provided in Appendix B. Other methods knovra to those of skill in 
20 the art can also be used. 

Decision Feed Back (DFB) impulse cancellation is an iterative process that uses a slicer 
output for Tones 1...K-1 to predict impulse inflicted noise on a Tone K. The technique is 
somewhat similar to using Decision Feedback for ingress (narrow band interference) cancellation 
in single carrier systems (but swapping time and frequency domains) because once the impulse 
25 timing is available from the Gram Schmidt process, the problems of multitone impulse 
cancellation and single carrier ingress cancellation become similar. Despite its relative 
complexity, the advantages of DFB over least square cancellation are numerous. 
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Far fewer quiet tones are needed for DFB impulse cancellation versus LS cancellation. 
In Least Square Cancellation, the accuracy of the estimation degrades as the quiet tones 
frequency separation is increased since the cross-correlation of impulse inflicted noise between 
tones decays as tones are separated in frequency. Thus, the wider the Xmit signal bandwidth as 
5 compared with the reciprocal of impulse duration, the more quiet tones are needed. In contrast, a 
DFB impulse canceler refines the impulse estimation, going from Tone K to Tone K+1, and 
therefore has no such limitation. In fact, if the background noise is ignored, even a single quiet 
tone for DFB cancellation can be suitable for accurate results. A further advantage is that there 
is lower leftover impulse estimation noise after cancellation. The impulse estimate is refined 
10 from one tone to the next. Thus, the final estimate is based on many more sample tones than are 
examined in Least Square cancellation. 

01 When DFB impulse cancellation is implemented, the impulse TOA stimation, e.g., as 
III output from the merge module 13 is provided directly to DFB module 70 (Fig. 2). In one 
^ embodiment, the summer/canceler 66 does not need to subtract an impulse estimation (such as 
ds from LS estimation) or blank. Rather, it just passes the signal on to match filter 68. The output 

of match filter 68 is fed to the DFB module 70 which does the actual impulse cancellation. 
Jfl Other arrangements are also possible. 

2 Fig. 5 is an illustration of one embodiment of DFB 70. For simplicity, it will be assumed 
H that there is only one quiet tone and it is the tone having the lowest frequency. The per-tone 

20 complex values (i.e., (I,Q) s) output of match filter 68 are provided as input 69 to DFB 68 in 

increasing order of tone frequency. (These complex values are composed of Xmit signal plus 

impulse noise plus background noise.) 

Adder 73 subtracts a current impulse prediction (which is output of rotator 88) from the 

complex value of the input 69, thus removing its impulse noise component. The cleaned 
25 complex value output of the adder 73 is then provided to sheer 74 which finds the nearest valid 

constellation point (i.e., slices). The output 76 of sheer 74 is then fed to a reliability estimator 94 

(Fig. 1, 2), discussed below, and eventually to the decoder 100. 
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Assuming that the slicer 74 had made no error, the true Xmit signal is provided to adder 
78 which subtracts the true Xmit signal from the complex value input at 69 (and transferred to 
adder 78 input along signal 80), v^here the input at 69 is composed of Xmit signal plus impulse 
noise plus background noise. This leaves only the impulse noise plus background noise at the 
5 output of adder 78. This background noise corrupted impulse estimation is input to Predictor 82. 

The Predictor 82 multiplies its input by (1- a ) at 84. The adder 86 sums the previous 
predictor output from delay buffer 90 multiplied by a at 92 with the multiplied output at 84. 
The coefficient a is the memory of the predictor, e.g. a=^l means ignore current sample (use only 
memory), while a=0 means ignore the past samples (no memory). When the rotator 88 is 
10 included, the result is essentially a single pole passband filter. The properties of rotator 88 
S determine the center frequency of the passband filter. The rotator frequency co (in radians) 
M corresponds to the impulse timing x from 50 multiplied by frequency difference Af between 
Ji' successive tones, i.e., (0=27z Af t. 

SI The impulse estimation needs to be rotated from Tone K to Tone K+1 because the tones 

'Is have different carriers. Since the tones have different carriers, the same is true for their match 
5 filters in match filter 68, and the match filters directly rotate the impulse (a part of the filtering 
m operation). Also note that for the quiet tone itself, the output of the slicer 74 is set to zero since 
f{ nothing was transmitted on that tone. 

In the event that there is only one quiet tone, but it is the tone at the highest frequency, 
20 the same process can be used, but starting from the highest frequency tone and going dovm to the 
lowest frequency tone. In this case, the rotator frequency (0= -2n Af t. In case there is a quiet 
tone that is not located in one of the extreme tones, but is somewhere in the middle, the DFB 
cancellation can be divided into two parts. First, the DFB can propagate from Quiet tone up and 
handle all higher frequency tones, and then can propagate dovm from Quiet tone to all lower 
25 tones. If there is more than one quiet tone, each tone can be used to cleanup its neighboring 
tones. 
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There are many ways known to those of skill in the art to construct the predictor 82 of 
DFB canceler 70, trading off complexity and accuracy. For the sake of simplicity, a rather 
simple first order predictor is preferred. Alternatively, for example, an N th order FIR predictor 
may be calculated directly from impulse noise cross-correlation among different tones. If the 
5 impulse is modeled as the gating on of a white gaussian noise source for the duration of the 
impulse, the random signal consisting of impulse samples on different tones is a stationary 
process having a sink shaped cross-correlation (among tones). The use of stationary random 
signal cross-correlation to build predictors is known to those of skill in the art. Also, using the 
equality (l-a)X+aY-X+a(Y-X), the one pole predictor 82 may be constructed using one 
10 multiplier and two adders instead of two multipliers and one adder. This transformation may 
% reduce complexity for some implementations. 

While the system and method of the preferred embodiment selects whether to blank or 
ill cancel a detected impulse, alternately the system can be simplified so that impulses are always 
!:{ blanked or always canceled. It is also possible to combine blanking with canceling using 
95 decision feedback (DFB canceling). For very strong impulses, some benefit can be achieved by 
U first blanking the signal and then performing DFB canceling. When the impulse is blanked, so is 
!fj the Xmit signal. This introduces an interference similar to an impulse having instantaneous 
J| power identical to Xmit signal power. If the impulse power is significantly higher than the Xmit 
tl signal, then blanking it would reduce the burden on the Decision Feedback impulse canceler. 
20 Because Decision Feed Back canceling uses a prediction based on earlier tones to clean the 
current tone, it is limited by the impulse process innovation from earlier tones to the current tone. 
If the innovation is too large, sheer errors will occur. Blanking, by reducing this innovation for 
large impulses, extends the impulse canceler operating range. 

Referring back to Figs. 1 and 2, and according to another aspect of the invention, two 
25 output demodulation paths are used - a corrected path which passes through the impulse 
corrector 18, and an uncorrected path which preserves the original input signal. This dual-path 
structure can compensate for false alarms and timing errors generated by the impulse correction 
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module 18 and is used to alleviate the conventional tradeoff between increasing probability of 
detection and increasing false alarms. For example, to reduce false alarms in the above 
described impulse detection system, the group size To and/or the THgs and THmw should be 
increased. However when To , THqs , or THmw are increased, the detection rate is reduced. 
5 Thus, it is advantageous to permit the impulse detection to generate false alarms and to correct 
them later. 

In this architecture, the corrected and the uncorrected signal paths are input to reliability 
estimator and selector 20. Selector 20 selects the best signal among these two. As will be 
appreciated by one of skill in the art, by using the double path architecture, even when a false 

10 alarm or timing error is present on the "corrected" path, the output to the decoder will be from 
the uncorrected data path and therefore there is no harm due to the false alarm. The strength of 

m the two path architecture is based on the accuracy of the Reliability Estimators 94, 96 (Fig. 2) 
% used in the estimator and selector module 20. Preferably, the reliability estimators 94, 96 are 

accurate enough to also generate reliable erasures which are subsequently used by the decoder. 
^315 The reliability estimators 94, 96 operate by slicing the respective input signal and 

JL, comparing the input of the slicer to its output. The closer the input signal is to the interpreted 
W output, the more likely the input signal is to be reliable. More specifically, each one of the 

11 NTones input tones is sliced, thus producing a vector of NTones Decisions. For tones where the 
F' Xmit signal is known, i.e., quiet tones (where the Xmit Signal is 0) and preamble tones (where 
20 the Xmit signal taken from a known preamble sequence), the known value is used in the slicer's 

decision vector. Then, the per tone Decision is subtracted from the per tone Slicer input to 
produce a difference vector. (If the various tones do not have the same margin vs. background 
noise, it is preferred to scale them according to their individual margins.) Then, the sum of 
squares of the difference vector is computed to produce the reliability metric. The reliability 
25 metric has units of energy, wherein the higher the metric the less reliable the data. 

Path Selection module 98 uses the generated Reliability Metric for two purposes. First 
the reliability metrics for the corrected and uncorrected data are compared and the input having 
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the lower metric (i.e., the more reliable) is selected. Then, preferably, the reliability of the 
selected path is compared to a threshold value, THera. If the reliability metric exceeds THera, 
then even the most reliable path is deemed too unreliable to use and the entire column is erased 
so that the decoder gets erasures instead of channel symbols for the columm. 
5 While dual architectures have been described, both for impulse detection (Gram Schmidt 

and Moving window detectors), and for impulse correction (corrected and uncorrected path). It 
is also possible to use only a single impulse detector and/or a single path architectiH^e to reduce 
complexity (at the price of a possible reduction in performance). 

In the impulse detection and correction techniques discussed above, it was assumed that 
10 the impulse/burst duration L is given. In cases where this duration is not known, the processes 
described herein may be repeated with each possible L. Because the computations for different 
Hi impulse durations are highly overlapping, a great deal of complexity may be saved by combining 
m certain elements and steps. For example, the whitener and whitener match filter need not be 
^ duplicated because the same processing is performed for all durations. The suppression and 
fll5 merge with other bands can also be combined for all durations. 

'L. It can also be appreciated that there may be cases were the impulses are weak enough 

W such that it is possible to modulate on some tones with a reduced bit loading even without 
In canceler/blanking (i.e., using a loading lower/worse than that obtained after impulse canceling). 
H In this case we may modulate on these tones using low loading. These tones can be considered 
20 blank tones, as opposed to quiet tones, since they will be used by blanking/canceling. The rest of 
the tones (none of which are blank tones) will use the loading obtained after applying the 
impulse canceler/blanker. These tones can be considered pure data tones. In this embodiment, 
the reception process will first demodulate the blanker tones, slice them, and then subtract the 
slicer output fi-om these tones. Assuming the slicing was successfiil, the subtraction turns the 
25 blanker tones into quiet tones that may be used with the impulse canceler^lanker. After 
applying impulse cancellation or blanking, the pure data tones are demodulated. The benefit to 
this embodiment is that it achieves the maximal loading possible for the pure data tones (i.e., the 
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loading obtained after applying the impulse canceler/blanker) while not entirely sacrificing 

bandwidth to quiet tones. 

Those of skill in the art will also recognize that the present invention is not tied to the 

particular communication scheme described, but instead may be used in addition to point-to- 
5 point communication schemes, with multipoint-to-point, point-to-multipoint, and multipoint-to- 

muhipoint schemes. In addition to multitone modulation, the invention can also be used with 

single carrier, CDMA and other modulation schemes. In non-multitone schemes, the transmitter 

should be configured to notch a few bands for use by the cancelerManker as the quiet tones. 

Standard mechanisms can be used in the transmitter/receiver to correct for these notches. 
10 This invention can be used in non-communication cases as well. For example, any signal 

G that is corrupted with impulses and has one or more quiet sub bands may be cleaned using the 
m above techniques by considering the quiet sub bands as quiet tones. 

[J: Yet another set of applications fox this invention is to cancel low dimension noise 

il contained in at most a few dimensions (i.e., noise exists in a low dimension linear subspace) 
fis from a high dimension signal where a few signal dimensions may be sacrificed to locate noise. 
'L, A low dimensionality noise is a random signal which is a linear combination of a small number 
Ul N of deterministic wavefDrms or basis functions. The waveforms are generally fixed and the 
JfJ particular coefficients for the linear combination are randomly distributed scalar variables. For 
(3 this type of noise, most of the signal energy is contained in this low dimensional space. In other 
20 words, a large portion of the noise energy is contained within a small number of basis fimctions. 
For example in the impulse case, most signal energy is contained in a few successive (in discrete 
time) delta functions. In the ingress (narrow band noise) most signal energy is contained in a 
few neighboring frequencies. In contrast, white noise is not a low dimensionality noise because 
it has the same dimension as the useful signal. Given M samples of a target signal which is 
25 contaminated by white noise, the dimension N of the white noise component is also M. Further, 
the white noise power is evenly split among all of its N=M dimensions and, therefore, the present 
invention is not well suited for detecting and canceling this type of high-dimensionality noise. 
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Low dimensionality noise can be detected and canceled by projecting the noise on an 
orthogonal basis in a manner similar to the treatment of an impulse, discussed above. An 
example is a single carrier modulation corrupted by fast popping narrow ingresses wherein the 
frequency of the ingress is not known in advance and this frequency may change from time to 
5 time. In such a case, a quiet symbol is transmitted periodically (with the period getting longer as 
the ingress becomes narrower). The quiet symbol acts as a quiet tone for use in the 
detection/cancellation process. The ingress is highly correlated between successive symbols 
similar to impulse correlation between successive tones for an impulse canceler. By using the 
above techniques but swapping the frequency and time axes, the ingress frequency and an 
10 estimate of its waveform can be determined. It will be appreciated that the ingress frequency in 
D this embodiment is the same as the impulse timing for the impulse canceler, 
m The dimension N of the matrices used in the noise detection and cancellation scheme of 

S! the invention is determined by the dimensionality N of the noise signal. Thus, the computational 

IK? ^ 

O complexity of this method increases directly with the noise dimension. Furthermore, when the 
e|5 noise is detected through the use of quiet samples, such as the quiet tones used for impulse 
detection, the number of quiet samples used is preferably at least N. Thus, the larger the 
yl dimensionality of the noise, the more of the available bandwidth which must be reserved to carry 
quiet samples instead of a data signal. In general, noise having a dimensionality of no more than 
H about three to five is well suited for detection and cancellation in accordance with the present 
20 invention, although use with higher dimensionality noise signals may be suitable in certain 
circumstances, such as when it is acceptable to cancel only a portion of a detected noise signal. 

While the invention has been particularly shown and described with reference to 
preferred embodiments thereof, it will be understood by those skilled in the art that various other 
changes in form and details may be made therein without departing from the spirit and scope of 
25 the invention. For example, the present invention can be implemented using time-domain 
techniques or frequency-domain techniques. 
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Appendix A: Gram Schmidt Derivation 
VCMT 

This section presents the derivation of the impulse timing estimation for VCMT. 

We model the input to impulse blanker/canceler (input to Noise whitener filter 30) as a 
combination of impulse noise and background noise. The impulse noise is modeled as the gating 
on and off (for the impulse duration) of a white noise source. A simplified discrete version of 
this model is 

J]a,Sin-k) 

k=0 

alternately (and more accurately) the impulse may be continuously modeled using a Fourier 
series. At this point we are not assuming any distribution on ak, as it is difficult even to 
characterize the impulse noise power. We simply treat the a^t's as free numbers. In appendix C 
we will give an alternate derivation assuming the a^s are Gaussian with known power. The 
background noise is modeled as a stationary colored Gaussian noise composed of Xmit signal 
and other noise sources. 

Noise whitener filter 30 (Figure 3) is used to whiten the background noise prior to 
impulse detection. Blocks after the whitener need to detect impulse existence using whitener 
output y(n). 

To decide for a specific timing (t), whether that time was hit by an impulse (HI) or not 
(HO), we may use the log likelihood ratio below. According to Neyman-Pearson, to minimize 
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False Alarm rate for a given Detection Rate, the LLR should be compared to a threshold THg5. 
In general HI should be declared for LLR's above the threshold TUgs and HO for LLR's below 
threshold TUqs (The higher the THgs threshold the lower the False Alarm rate). Note that the 
LLR is maximized with respect to h, which is the impulse waveform (time domain), thus 
5 performing maximum likelihood for impulse coefficients, 

LLK^ET^cAn- ^ Y^ri]^ /p(^y^t-s:t + s)IH^))] 

With, y, Signal input to whitener match filter 

Hj , the hypothesis impulse with waveform h has hit at t 

=11 Hq, the hypothesis impulse has not hit at t 

jlf) h, the impulse waveform (time domain) 

^ij s, the LLR "support", i.e., half the length of y on which LLR is computed 

ff^ However we cannot declare Hi for all threshold exceeding LLRdetect's. Typically since 

around the true timing of the impulse there would be a neighborhood of high LLRdetect's, we 
m would like to suppress the additional false detections. One intuitive way to suppress false 
=115 detections is to look for local neighborhood maximas (i.e., mountain peaks). We will now give a 

detailed explanation shoving this is at least approximately correct. 

To choose among the two candidate times {tl & t2) the more likely candidate we can use 

the LLRtimejelect below. Hypothesis Hi stands for impulse has hit at tl, and hypothesis H2 

stands for impulse has hit at t2. If LLR is positive we will declare H2 and if LLR is negative we 
20 will declare Hi. The vectors hi, h2 are the (probably different) impulse waveforms for Hi and 

H2. Note that the LLR is maximized over hi & minimized over h2, thus performing maximum 

likelihood for impulse timing and coefficients. 
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LLB^_sELECTit%t2)= Lfi{p{yitl-s:t\ + s)IH,))- ^^Lr{p{y{t2~s:t2^s)l H,)) 

With, y. Signal input to whitener match filter 
Hi, the hypothesis impulse has hit at ti 
the hypothesis impulse has hit at ti 
hi, the impulse waveforai (time domain) for Hi 
h2, the impulse waveform (time domain)for H2 

5, the LLR "support", i.e., half the length of y on which LLR is computed 
Moreover UJKtimejelect may be used to find the genuine impulse among a 
neighborhood of high LLKdetecT' This may be accomplished by computing the IA^Rtimejelect 
for each pair of times, and selecting the time topt having positive h'LRriMEjELECT vs. all 
neighbors. Note we only have to suppress close neighbors, since as we depart in time fi-om the 
impulse, the IA^Rdetect drops and will not pass the detection threshold THg^. 

Assuming the support {s) of y on which the LLR computation is performed, is big enough 
vs. neighborhood of false alarms (i.e., t7-t2), we have 

p{y{t\ - 5 : rl + s)IH^) « p{y{t2 -s:t2 + s)/H,) 
Since, as we will see below, each side of this (approximate) equality is a function of the L2 norm 
of its;; segment, and the segments in this case are almost entirely overlapping. 
Thus 

^^^TIME _SELECt(^^^^^) ~ ^^^DETECt(^^) ~ ^^^DETECt(^^) 

This (approximate) equality shows a time t having LLRdetect higher than its neighbors 
would be preferred by LLRtime_select and selected as topt. Thus, instead of directly computing 
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LLRtime select we may find local neighborhood maximas (mountain peaks) of LLRdetect that 
exceed TUgs- 

The background noise is whitened by Noise Whitener 30, which outputs a a power white 
Gaussian noise. However, the impulse is also affected by noise whitener 30. We have modeled 
an impulse of length L starting at time instant t, 



thus after passing through whitener 30 we get a smeared impulse version 

k=Q 

Using all these we get: 

C-^^v(---l\(y-K)/cT' 



Max 



2 ^ 
) 



C-exp(-^'5]L /cT 



Max 
h 



Obviously h^;,^ (i.e., the h that maximizes the LLR) cannot change the first term (in the last 
formula), it can only reduce the second term to maximize the LLR. Thus, hopt needs to get as 
close to {y„} as possible. Assuming Uk are free numbers we get that hopt is the projection of {yn) 
on the subspace spanned by 



L-\ 

k=0 
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It is known from linear algebra that the projection of {y„} on a subspace is the best 
"approximation" existing in the subspace to {y„}. Thus, assuming at"'" are the coefficients of 
Kph we would like to compute the following score for time t (Impulse time would be selected as 
the time having highest score) 

Score(t) = \\yf-\\y-J^arnn-k)f 

Since hopt is a projection ofy, it is known that y equals the following orthogonal sum (i.e., sum of 
orthogonal vectors) 

y = d - ^))® - s <7(« - k)) 

As a result we have (Pitagoras over L2 vector space) 
Thus 

Scoreit) = IE <7(« - ^)|r - 1^... r 
So we need to compute the norm ^h^^^f of the projection of on the subspace spanned by 



Y^af^f(^n-ky But to reduce complexity we do not want to compute /z^p? directly. This can be 
performed using a L*L matrix A that orthogonalizes the following basis 
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i.e., 

iFAy{FA) = I 

with F as the following (2*s+L)*L matrix 
and with L, The modeled impulse duration. 

Then applying ParsavaPs theorem, the norm lKpt\ equal to the sum of squares of 
projections of h^^^ on any orthonormal basis (which h^^^ is a member of its span), since FA 
columns constitute such a basis we have: 

Score = \\^^,f ^\\(^^^^ 

Such a triangular matrix A may be computed using Cholesky factorization on FT, Note that A is 
the matrix resulting from the Gram Schmidt process on the basis B={Fo.. Vi.]}, 

However (Fy) are L consecutive samples of the whitener match filter output 32 (Figure 
3). So we only need to additionally apply A ' and than square and sum up. The grouping into L 
sample size groups is performed by FIFO 40, the multiplication by ^ ' is performed by Gram 
Schmidt module 42 and energy summation is performed by energy summation module 44. Local 
neighborhood maxima is found by Group Divider 46 and Local Maxima detector 48. And finally 
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comparison to threshold THgs is performed by Threshold Local Maxima module 50. Note the 
fact A is triangular reduces matrix application (multiplication) complexity further. 

The description above does not refer directly to matrix A, instead it refers to matrix 
Mgs=A', which is applied by Gram Schmidt module 42 in Figure 3. The following are the steps 
for computing Mqs- Although the matrix Mgs is applied (matrix multiplication) for each t, it 
need not be recomputed (Cholesky factorization) for each t. Matrix Mgs depends only on Gk 
and L that change when channel characteristics change (background noise or Impulse duration) 
and thus change slowly. Thus, the matrix Mgs may be computed periodically by a low rate 
process. 

This periodic Gram Schmidt matrix computation task is comprised of the following steps: 
The system assesses its input noise spectrum, including both stationary noise sources and the 
Xmit signal itself Using the measured noise spectrum, it adapts the coefficients Gk so that the 
whitener filter 30, having impulse response f(n), will whiten the input noise spectrum. Then 
using f(n) we build the matrix F = [V^ V, ... F^., ] with Vk defined above (definition of basis 
B). Then using, for example, Cholesky factorization, we find Mgs such that Mf^gF' FMqs'= I 
(since McsF'FMcs'= {FMcsliFM^s') = J)- Note that the whitener filter f(n) and the whitener 
match filter P(-n) and matrixes Mgs and Mls should be updated at the same time. 

DMT 

In this section we will derive impulse timing estimation for DMT. We are using the same 
impulse and background noise modeling as for VCMT. 
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Noise whitener 30 (Figure 3) is used to whiten the background noise prior to impulse 
detection. For the case of DMT, the noise whitener can not be a linear time invariant fiher (as for 
VCMT) since the LTI filter will smear the DMT. Instead we can use the output of the DFT 
(Discrete Fourier Transform) used for DMT signal demodulation, followed by a frequency 
equalizer (i.e., appropriate gain Gk per tone). The DFT demodulates the Xmit signal and splits 
the background noise into orthogonal (uncorrelated) components. The frequency equalizer 
attenuates tones carrying stronger Xmit signal or background noise (both appearing to be noise 
for blanker/canceler). Thus, the final whitener output Y(K) is in frequency (tone) domain. 

The blocks after the whitener need to detect unpulse existence using whitener output 
Y(K). As before, the log likelihood ratio LLRdetectis used for detecting impulses. The LLR is 
maximized with respect to H(K), which is the impulse waveform (in the frequency domain), thus 
performing maximum likelihood estimation for impulse coefficients. The support s parameter is 
not needed, since for DMT the LLR is based exactly on one DMT symbol length. 



With, Y(K), Signal input to whitener match filter in frequency domain 

Hi, the hypothesis impulse having waveform H(K) has hit at t 

Ho, the hypothesis impulse has not hit at t 

H(K), the impulse waveform (in frequency domain) 

However we cannot declare Hi for all thresholds exceeding LLRdetect's. Typically, since 
around the true timing of the impulse there would be a neighborhood of high LLRdetects, we 
would like to suppress the additional false detections. As with the VCMT case we would like to 
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suppress false alarms by looking for local neighborhood maximas (i.e., mountain peaks). The 
explanation is similar to the VCMT case. Here the following equality holds exactly: 



LLR. 



TIME SELECT 



itl,t2) = LLR^^cTi^V) - LLR^^rECTif^) 



since the support of hypothesis Hj and H2 is identical. This shows that by maximizing LLRdetect 
we find a time topi that is also selected by LLRtjmejelect- 

The background noise is whitened by Noise Whitener 30, which outputs a a power white 
Gaussian noise. However, the impulse is also affected by noise whitener 30. We have modeled 



the impulse as^a^S(n-t- k), thus after passing through whitener 30 we get a scaled impulse 
version which has also been operated on by the DFT. 



With N, number of DMT tones also equal to NFFT (size of FFT) 

K, being the tone number, 0<=K< N 

Gk, being tone K frequency noise equalizer gain 

t, the sample index, within a DMT symbol (0<=t<NFFT), that the impulse started 

L, being the modeled impulse length 

Un, being the impulse coefficients prior to DFT 



Using all these we obtain: 



Max 



1 NttT-i 9 

C-exp(-^- £ \{Y{K)-H{K,t))la\') 



2 NFFT-l 

2 



C-exp(-i- £ \Y{K)lcT^) 



Max 
H 



fNFFT-\ 



c 



K=0 



NFFT-\ 



X \Y{Ki- S \Y{K)-H{K)\ 



K=0 
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As with the VCMT case, Hopt(K) (i.e., the H(K) that maximizes the LLR) needs to get as 
close to Y(K) as possible. Assuming a„ are free numbers we get that Hopt is the projection of 
{Y(K)} on the subspace spanned by basis Bt below. As it is known from linear algebra that the 
projection of {Y(K)) on a subspace is the best "approximation" existing in the subspace to 
5 {FfX:;}, this yields: 



^0 







V 


V = 


n 


n 

V 



Go-expe^O) 



,for n=t.J+L-l 



With NFFT, size of FFT, Also equal to NTones (Number of DMT tones) 

t, The sample index, within a DMT symbol (0<=t<NFFT), that the unpulse 

started 
t+L-1, Impulse stop time 

Gk, being tone K frequency noise equalizer gain 
II Note for each t we get a difference subspace span(Bt). We would like to compute the following 
=1 score for time t (Impulse time would be selected as the time havhig highest score) 

Score(t) = \\Y\f-lY-H„^,(;Of 
1 5 Since Hopt is a projection of Y, it is known that Y equals the following orthogonal sum (i.e., sum 

of orthogonal elements) 

y = (Hj®{y-Hj 
As a result we have (Pitagoras over L2 vector space) 



lYf=\\Hj\lY-Hj 
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Thus 

Scoreit) = lH„^ 



a 



So we need to compute the nonri ||^^op,(-,0|| of the projection of {Y(K)} on the subspace spanned 
by Bf But to reduce complexity we do not want to compute Hopt directly. This can be done using 
a matrix At that orthogonalizes the matrix Ft below, i.e., (F,A,y(F,A,) = / , with NFFT*L matrix 
Ft being composed of the basis vectors from Bt as column vectors, i.e.: 

With r Impulse start time 
r+L-1, Impulse stop tune 
Gk, being tone K frequency equalizer gain 

As with the VCMT case, applying Parsaval's theorem we get: 

Score(0 = \\Hj =\\iFAyYf =\\A,\F;Y)f 
Such a triangular mafrix At may by computed using Cholesky factorization on Ft 'Ft. Note that At 
is the matrix resulting from the Gram Schmidt process on the basis Bt={ Vq.. Vl-i}. 

The multiplication (F'Y) may be implemented efficiently using IFFT. Computation for all 
ranges t..t+L-l (0<=t<NFFT) within a DMT symbol may be performed all at once. First we need 
to compute the vector W such that W(K)=Y(K)Gk (note no complex conjugate is needed for Gk 
since it is real) and then compute U=NFFT*IFFT(W). As a result U(t..t+L-1) corresponds to 
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To summarize the processing perfomied for each Hypothesis test at time t: the whitener 
30 (Figure 3) needs to perform FFT on its input and multiply each tone K by Gk. The whitener 
output Y(K) goes to whitener match filter 32 that again multiplies each tone K by Gk and 
performs IFFT (the two multiplications may be combined into one multiplication by Gk). Then 
5 FIFO 40 groups the whitener match filter output into L length overiapping groups. Multiplication 
by A/ is performed by Gram Schmidt module 42 and energy summation by Energy Summation 
module 44. Local neighborhood maxima is found by Group Divider 46 and Local Maxima 
detector 48. And finally comparison to threshold THgs is performed by Threshold Local Maxima 
:i{ module 50. Note, the fact that At is triangular also reduces the multiplication complexity. 
S|0 The previous description does not refer directly to matrix At, instead it refers to matrix 

SI MGs-pAt\ which is applied by Gram Schmidt module 42 in Figure 3. We will now point out the 
iii steps for computing Mgs- As in the VCMT case, the matrices At may be computed periodically 
by a low rate process. 

2 This periodic Gram Schmidt matrix computation task is comprised of the following steps: 

'"15 The system assesses its input noise spectrum, including both stationary noise sources and the 
Xmit signal itself Using the measured noise spectrum, it adapts the coefficients Gk so that the 
whitener fiher 30, having impulse response f(n), will whiten the input noise spectrum. Then 
using Gi, G2, . . . Gnfft-1 we build the matrixes Fi, F2, . . . FNPFT-b with = [V^ V^^^ ... V^^^_^] 
and Vk defined above (definition of basis Bt). Subsequently, we need to find for each t a matrix 
20 Mosj such that M^^ t^t^Mos This can be performed using Cholesky factorization. 
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Note that the whitener and whitener match filter Gk's and matrixes Mgs and Mls should be 
updated at the same time. 
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Appendix B: Impulse Least Square Estimation 



The impulse waveform least square estimation is used for the linear canceling (as 
opposed to decision feed back canceling). It uses primarily the quiet tones in order to determine 
5 the impulse coefficients. It is similar to an over determined solution of set of equations 
(Generalized Least Square solution). The derivation is straightforward and may be applied to 
VCMT and DMT as well. Here we explain the VCMT case. 

Given the output of whitener 30 (Figure 3) is and the whitener 30 filter shape \^f(n), 
3 and the whitened noise power is a; we would like to find impulse (smeared hyf(n)) waveform hn 
}1 0 such that the following log likelihood is maximized: 



15 



LL = ^^^{c • exp(-i . ^\(y^ -hj/a 



) 



Max 



C 



■^\yr.-Kf) 



Assuming a/c, the impulse coefficients before whitener 30, are free numbers we get that h„ is 
h„=Y^aJ(n-k) 

k=0 

Inserting this equation into the Log Likelihood we can obtain the a^: using 

Using the "Orthogonality Principle", the error of optimal solution, projected on space spanned by 
^ aj^f (n-k), is zero we have: 



argmin 



a. 
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with F the following (2*s+L)*L matrix (for the VCMT case) 

F = [K V, ... F,_J 

L, The modeled impulse duration 

Vk, Defined as B in Appendix A the VCMT case (similar derivation is possible for the 
5 DMT case) 

Thus we obtain: 

Note that Fy is already available at the output of v^hitener match filter 32 (Figure 3). It was 
iio already grouped by FIFO 40 and used for impulse timing computation by Gram Schmidt module 
gl 42. Now we can reuse the output of FIFO 40 for impulse waveform estimation. We only need to 
'^1 multiply the output of FIFO 40 by matrix (FF)'\ a L*L matrix, with the product being the 
t. impulse waveform estimation. Also note we only need to perform this multiplication on the 
j timing estimated by Gram Schmidt module 42. Thus, matrix multiplication is performed at a low 
fis rate and has a small effect on complexity. Furthermore, the computation of (FT)'^ needs to be 
performed at a very low rate. Since F is dependent only on channel characteristics (Background 
noise and Impulse duration). 

The above description refers to impulse estimation matrix Mls which should be 
computed as follows: The system assesses its input noise spectrum, including both stationary 
20 noise sources and the Xmit signal itself Using the measured noise spectrum, it adapts the 
coefficients Gk so that the whitener filter 30, having impulse response f(n), will whiten the input 
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noise spectrum. Then, using f(n) we construct matrix F = [Fo ... F^.J with Vk as defined 
for basis B in Appendix A, i.e., the VCMT case. Finally we use F to compute MLs^^^'i^'^ • 

Appendix C: Using Apriori Impulse Power Knowledge 

In this section we will derive impulse timing estimation for VCMT assuming we have an 
apriori knowledge of impulse power. 

We model the input to impulse blanker/canceler (input to Noise whitener filter 30) as a 
combination of impulse noise and background noise. The impulse noise is modeled as the gating 
on and off (for the impulse duration) of a white Gaussian noise source. A simplified discrete 

L-\ 

version of this model is ^aj^S{n - k) , alternately (and more accurately) we could have modeled 

jt=0 

the impulse continuously using a Fourier series. Thus, are assumed Gaussian with power oi. 
The background noise is modeled as a stationary colored Gaussian noise composed of Xmit 
signal and other noise sources. 

Noise whitener 30 (Figure 3) is used to whiten the background noise prior to impulse 
detection. Blocks after the whitener need to detect impulse existence using whitener output y(n). 
As before the log likelihood ratio LLRdetect is used for detecting impulses. The LLR is 
maximized with respect to h(n), which is the impulse waveform (in the time domain), thus 
performing maximum likelihood estimation for impulse coefficients. 
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p{y{t-s:t^s)IH,) 



With, y, Signal input to whitener match filter 
Hi, the hypothesis impulse has hit at t 
Ho, the hypothesis impulse has not hit at t 
h, the impulse waveform (time domain) 

s, the LLR "support " i.e., half the length of y on which LLR is computed 
However, we cannot declare Hi for all thresholds exceeding LLRdetects, Typically, 
since around the true timing of the impulse there would be a neighborhood of high LLRdetect^, 
we would like to suppress the additional false detections. As with previous cases, we can 
suppress false alarms by looking for local neighborhood maximas (i.e., mountain peaks) of 

LLRoETECT^- 

The background noise is whitened by Noise Whitener 30, which outputs a a power white 
Gaussian noise. However, the impulse is also affected by noise whitener 30. We have modeled 

the impulse as^<3^<^(w - k) , with Ok being gaussian with power a\ . Thus, after passing through 

L-\ 

whitener 30 we get a smeared impulse version = ^aj^f(n - k) . Using all these we obtain: 



Max 

LLRj^gjEQj (t) = ^ Ln 



C-exp(-^-X 








2 

) 


C-expC-i-X: 


y 1 (J 

n 


2 

) 



Max 
h 



''C^{^\y.t-Y\yn-Kt-i:\a,-crla, 
To efficiently calculate this maximization we use the following approach: In previous derivations 
we were working in a linear vector space of dimension N=dim(y) (i.e., having the same 
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dimension as y), here we would like to expand this space so it additionally contains the akS as its 
coordinates. This enables the formulation of the maximization above in terms of vector 
projection in the expanded linear space. 

To re-formulate the maximization we first have to define vectors {Vk}k=i..l - Vector Vk 
is a column vector composed of two parts. The upper part has dimension N and the lower part 
has dimension L. The upper part is constructed by shifting down K locations and padding 
with zeroes. The lower part is constructed by putting afo\ in location K and zero in all other L-1 
locations. The following is an illustration of Vk- 
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In addition we need to append y with L zeros. Using the extended y and Vk we can re-formulate 
the maximization: 
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The re-formulated maximization is identical to the original one. The difference is that the 
related components are generated by the vector extensions. Using this vector formulation it is 
o obvious we need to minimize the second addend. The second addend is simply the error of 
M 5 approximating y using Vk's. Moreover it is known from linear algebra that the projection of y on 
a subspace spanned by Vk is the best "approximation" existing in the subspace to y. Thus, to find 
the maxima we should project the extended y on the subspace spanned by Vk. This is identical to 
fl cases of impulse detection without apriori knowledge (^pendix A) but in the extended space. 

Continuing similarly to previous derivations wq obtain Score(t) = ||X^r^f • ^^^^ 

10 to compute the norm of the projection of y on the subspace spanned by {F^}. This can be 

performed using a matrix 'A' that orthogonalizes the basis {Fj^ \k = 1 ..i} i.e. 

(VAy(VA) = I 



with 



v = [v, V, ... 

L, The modeled impulse duration in samples 
Then 
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Score(,t) = \\^al''%f =\\(VAyyf =\\A'iV'y)\ 



Such a triangular matrix A may by computed using Cholesky factorization on V'V . The fact that 
A is triangular further reduces the complexity. 
Since y was extended with zeros it turns out that 



s, the support of f(n) (i.e., length of f(n) is 2*s+l) 
L, The modeled impulse duration in samples 

However (F'y) are L consecutive samples of the whitener match filter output 32 (Figure 3). So 
we only need to additionally apply A ' and then square and sum up. Then we can continue as 
before (Appendix A). The only difference vs. appendix A is a different computation for matrix 
A, i.e., Cholesky factorization on FTinstead ofF'F. 

The description above does not refer directly to matrix A, instead it refers to matrix 
Mgs=A', which is applied by Gram Schmidt module 42 in Figure 3. To summarize, the steps for 
computing Mgs for this apriori impulse power case are as follows. As in the VCMT case, the 
matrix A may be computed periodically by a low rate process. 



(V'y)=(F'y). 



with 



F = [fo fr - /.-J 
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This periodic Gram Schmidt matrix computation tasic is comprised of the following steps: 
The system assesses its input noise spectrum, including both stationary noise soiirces and the 
Xmit signal itself. Using the measured noise spectrum, it adapts the coefficients Gk so that the 
whitener filter 30, having impulse response f(n), will whiten the input noise spectrum. Then 
using f(n) we build the matrix F = [Vq F, ... F^.J with Vk defined above. Then using, for 
example, Cholesky factorization, we find Mqs such that McsFFMcs'=I (since 
McsF'FMcs'= {FMgs)\FMc;s) = !)■ Note that the whitener filter f(n) and the whitener match 
filter P(-n) and matrixes Mqs and Mls should be updated at the same time. 
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